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Abstract 

The multi SI /PI nuclease AtBFN2 (EC 3.1.30.1) encoded by the Arabidopsis thaliana At1g68290 gene is a glycoprotein that 
digests RNA, ssDNA, and dsDNA. AtBFN2 depends on three zinc ions for cleaving DNA and RNA at 3'-OH to yield 5'- 
nucleotides. In addition, AtBFN2's enzymatic activity is strongly glycan dependent. Plant Zn 2+ -dependent endonucleases 
present a unique fold, and belong to the Phospholipase C (PLQ/P1 nuclease superfamily. In this work, we present the first 
complete, ligand-free, AtBFN2 crystal structure, along with sulfate, phosphate and ssDNA co-crystal structures. With these, 
we were able to provide better insight into the glycan structure and possible enzymatic mechanism. In comparison with 
other nucleases, the AtBFN2/ligand-free and AtBFN2/P0 4 models suggest a similar, previously proposed, catalytic 
mechanism. Our data also confirm that the phosphate and vanadate can inhibit the enzyme activity by occupying the active 
site. More importantly, the AtBFN2/A 5 T structure reveals a novel and conserved secondary binding site, which seems to be 
important for plant Zn 2+ -dependent endonucleases. Based on these findings, we propose a rational ssDNA binding model, 
in which the ssDNA wraps itself around the protein and the attached surface glycan, in turn, reinforces the binding complex. 
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Introduction 

Nucleases catalyze the cleavage of phosphodiester bonds in 
nucleic acids, and can be classified into DNases and RNases [1]. 
The multi SI /PI nucleases have a broad specificity. They can 
cleave both single-stranded DNA (ssDNA) and RNA, but show a 
lower affinity for dsDNA. The SI /PI nuclease class takes its name 
from PI nuclease from Penicillium citrinum [2] and Nuclease SI 
from Aspergillus oryzae [3,4], which are both glycosylated 
endonucleases, and share 51% sequence identity (68% homology). 
SI /PI endonucleases are also present in plants, and are involved 
in programmed cell death (PCD) via hydrolysis of genomic DNA 
[5]. Plant SI /PI endonucleases are divided into two broad types 
according to their metal ion cofactor: Ca 2+ -dependent and Zn 2+ - 
dependent endonucleases (EC 3.1.30.1) [6]. Ca + -dependent 
endonucleases prefer ssDNA over RNA at neutral pH, and may 
be involved in plant immunity [7] . Acidic pH leads to an increase 
in activity of Zn i+ -dependent endonucleases including Arabidopsis 
thaliana multifunctional Nuclease 1 (AtBFNl or ENDOl) [8,9], 
AtBFN2 [10], BEN1 [11], CEL1 [12], ZEN1 [13], ABN1 and 



HBN1 [14]. N-Glycosylation patterns are highly conserved within 
the plant Sl/Pl endonucleases [15], although these sites are not 
consistent with those in PI nuclease [16]. 

AtBFN2 (also called END02) is a 34.5 kDa glycoprotein (the 
predicted mass for protein component alone is 30.5 kDa), encoded 
by the Arabidopsis thaliana Atlg68290 gene. AtBFN2 follows the 
standard glycosylation pattern of plant Sl/Pl endonucleases, with 
three glycosylation sites (Asn91, AsnllO, and Asnl84). While the 
PI nuclease has a higher specificity for RNA over ssDNA, in a 
previous study we showed that the opposite is true for AtBFN2 
[10]. In previous studies we have further shown that the enzymatic 
activity is strongly glycan dependent, which was fully inhibited by 
complete de-glycosylation, after treatment with PNGase F [10]. 

Structurally, Zn 2+ -dependent endonucleases present a unique 
fold, and are classified into the PI /SI nuclease family (pfam ID 
PF02265) [17], itself part of the Phospholipase C (PLC)/P1 
nuclease superfamily (pfam ID CL0368) [18]. The active site 
within the Pl/Sl nuclease family is very well conserved [19,20]. It 
contains nine zinc cluster interacting amino acids (1 tryptophan; 3 
aspartic acids and 5 histidines) [15]; the conservation of this metal 
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cluster extends into the PC-PLC/P1 endonuclease superfamily 
despite low sequence identity [21]. 

Despite extensive efforts to understand how Zn 2+ -dependent 
endonuclease bind and digest ssDNA [16,22], it is still not clear 
how, or even whether, these endonucleases unwind and cleave 
dsDNA. Further, the mechanism by which endonucleases 
specifically interact with ssDNA remains obscure. All co-crystal 
structures show partially digested DNA in the active site [16], or 
crystalline artifacts outside of it [17]. Additionally, though it has 
been previously suggested that glycosylation is relevant for protein 
structural integrity [10], a role for glycans in substrate binding or 
modulation of active site shape may also be important. 

To address these open questions, we extended our previous 
studies based on AtBFN2 [16]. We improved the resolution of our 
earlier sulfate co-crystal [16] from 1.76 A to 1.22 A, thus yielding 
more insight into the glycan structure. We obtained a ligand-free 
AtBFN2 structure, as well as a phosphate co-crystal structure in a 
new unit cell with space group P 1 . We also found that phosphate 
and its transition state analog vanadate were both capable of 
inhibiting enzyme activity. Finally, we obtained a detailed 
ssDNA»AtBFN2 co-crystal structure by soaking the crystals with 
a thiophosphorylated ssDNA analog. In this structure a large 
secondary ssDNA binding site was observed. Based on these 
findings, we propose a rational model for ssDNA»AtBFN2 binding, 
in which the ssDNA wraps itself around the protein and the 
attached surface glycan, in turn, reinforces the complex, acting not 
unlike an elastic band. 

Materials and Methods 

Cloning, overexpression, mutagenesis and purification of 
AtBFN2 

The END02 cDNA was amplified by PCR with primer pairs 
B2-1 IF and B2-12R to include a C-terminal 6 x His-tag (Table SI, 
file SI). The END02-His-tag was cloned into pBI121 binary 
vector to form CaMV35SP::END02-hisOE, and introduced into 
Agrobacterium tumefaciens strain GV3101::pMP90 by electropo- 
ration. Arabidopsis plant transformation was performed with the 
vacuum infiltration method (http://transplant.sinica.edu.tw/ 
english/protocol/trans/l.htm). Seeds were surface-sterilized in 
1% NaOCl and 0.05% Tween-20 solution for 15 minutes. After 
washing with sterile distilled water, seeds were plated in 
kanamycin-selection medium (1/2 Murashige-Skoog medium 
containing 0.11 g 1 1 B5 vitamins, 1% sucrose, 50 mg 1 1 
kanamycin, pH 5.7 and 0.8% Phyto agar). Seedlings were grown 
under long-day conditions (16 h light/8 h dark) at 22±1°C with 
light irradiance of 150 mol m 2 s 1 and relative humidity of 55%. 
Kanamycin-resistant seedlings were planted in soil. After siliques 
wilting, seeds were collected and stored at 4°C. Two week-old 
seedlings (homozygous) were harvested to extract the AtBFN2 
protein for purification. 

AtBFN2 was purified following the method of Ko et al. with 
minor modifications [10]. Soluble proteins were extracted from 
transgenic seedlings with 20 mM Tris buffer (pH 7.5) containing 
an EDTA-free protease inhibitor cocktail (Roche, Germany). After 
centrifuging at 27000 g for 90 min, the supernatant was loaded 
into a Q_ column (QSepharose Fast Follow, GE Healthcare, USA), 
equilibrated and washed with 20 mM Tris buffer. The AtBFN2 
was eluted and recovered by the His-wash buffer (20 mM Tris, 
pH 7.5 with 250 mM NaCl). The fraction was loaded to a Ni 
Sepharose 6 Fast Follow (GE Healthcare) column equilibrated 
with His-wash buffer. After washing with His-wash buffer, 
AtBFN2 was eluted with elution buffer (20 mM Tris, pH 7.5, 
250 mM NaCl and imidazole from 0 to 250 mM). The collected 



fractions (60 to 120 mM imidazole) were applied to 5ml 
Concanavalin A column (Con A, Sigma, USA) washed with 
Con A pre-washing solution (1 M NaCl, 5 mM MgCl 2 , 5 mM 
MnCl 2 , 5 mM CaCl 2 ), and then equilibrated with Con A wash 
buffer (20 mM Tris, pH 7.5, 500 mM NaCl). After washing with 
Con A wash buffer, the AtBFN2 was eluted with Con A elution 
buffer containing 5 to 500 mM glucose and a-D-mannopyrano- 
side. The Con A eluted fractions were concentrated with 10 kDa 
Vivaspain turbo (Sartorius Stedim Biotech, UK) then applied to a 
Sephacryl S-100 High Resolution (GE Healthcare) column in 
buffer (20 mM Tris pH 7.5, 50 mM NaCl). Recombinant 
AtBFN2 protein was detected with 12% SDS-PAGE. Purified 
AtBFN2 was concentrated with 10 kDa Vivaspain turbo for 
crystallization. 

Crystallization, data collection, and structure 
determination 

All crystals were obtained using the sitting-drop vapor diffusion 
method at temperatures ranging from 293 to 298 K. Ligand-free 
AtBFN2 crystals grew within 5 to 7 days in drops prepared by 
mixing 1 (0,1 of the reservoir solution (0.1 M Tris pH 8.5, 0.2 M 
NaOAc, 30% (w/v) PEG 4000) with 1 ul of the protein (6.0 mg/ 
ml). All AtBFN2/S0 4 ; AtBFN2/P0 4 and AtBFN2/A 5 T crystals 
grew within 3 to 4 weeks in drops prepared by mixing 1 |il of the 
reservoir solution (0.1 M Hepes, pH 7.5; 0.2 M LiS04 and 30% 
(w/v) PEG 3350 for AtBFN2/S04 and AtBFN2/A5T; 0.1 M 
Hepes, pH 7.5; 0.2 M Na2HP04 and 30% (w/v) PEG 3350 for 
AtBFN2/P0 4 ) with 1 ul of the protein (8.5 mg/ml). The 
AtBFN2/A 5 T co-crystals included thiophosphorylated nucleo- 
tides: d[A(s)A(s)A(s)A(s)A(s)T], denoted as A 5 T. AtBFN2/S0 4 
crystals were washed in 0.1 M Hepes, pH 7.5 and 30% (w/v) PEG 
3350 for 5 seconds, and then soaked in 0.1 M Hepes, pH 7.5 and 
30% (w/v) PEG 3350 with 0.2 mM A5T for 30 minutes. 
Subsequently, they were flash-cooled to 100 K in a stream of 
cold nitrogen. 

The X-ray diffraction images were collected using the SPXF 
beamline BL13B1; BL13C1 and BL15A1 of the National 
Synchrotron Radiation Research Center (NSRRC) in Taiwan, 
and were then processed using HKL2000 [23]. Prior to use in 
structure determination, 5 % randomly selected reflections were set 
aside as our Rfree reference [24] . The statistics of data collection 
are summarized in Table 1. 

The structure of AtBFN2 crystals were solved by using 
molecular replacement with the PDB ID: 3W52 as a search 
model [16]. All calculations were carried out by using CNS [25] 
and CCP4 [26]. The program COOT was employed for model 
building [27]. The models were further improved by introducing 
polysaccharides, sulfate ions, water molecules, and nucleotides. 
Statistics of the refined structure were produced using REFMAC5 
[28,29] and MolProbity [30] and can be found in Table 1. All 
comparison of the structures were performed using the PyMOL 
Molecular Graphics System, Version 1.5.0.4 Schrodinger, LLC. 

ssDNA activity assay 

ssDNA activity was measured in triplicate according to the 
method of Ko et al. [10]. Purified AtBFN2 protein (0.1 ug) was 
incubated with 1 mg of heat-denatured ssDNA substrate in 200 ul 
of various buffers in each 1.5 ml Eppendorf tube at 37°C, 
10 minutes. All assays were performed in lOOmM Tris pH 7 
buffer, to which different putative inhibitors were added (100 mM 
Na 2 S0 4 ; 100 mM LiCl; 100 mM Li 2 S0 4 ; 1 mM and 10 mM 
Na 3 V0 4 ; 1 mM and 10 mM and 100 mM NaH 2 P0 4 ). In order to 
stop the reaction, we added 50 ul of cold 1 M HC1, followed by a 
10 minute incubation period on ice. After centrifugation at 
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Table 1. Data collection and refinement statistics. Numbers in parentheses are for the highest resolution shells. All positive 
reflections were used in the refinement. 







AtBFN2/ligand-free 


AtBFN2/S0 4 


AtBFN2/P0 4 


AtBFN2/A 5 T 


Data Collection 


Space group 


P1 


P2 1 2,2 1 


PI 


P2,2 1 2 1 


Unit-cell a, b, c (A) 


45.4, 52.7, 61.1 


54.1, 66.8, 99.6 


45.5, 52.9, 60.8 


53.9, 66.2, 98.1 


Unit-cell a, ft y (°) 


71 .3, 78.6, 76.7 


90.0, 90.0, 90.0 


71.3, 78.6, 76.6 


90.0, 90.0, 90.0 


Resolution (A) 


50-2.10 (2.18-2.10) 


50-1.22 (1.26-1.22) 


25-2.0 (2.07-2.00) 


50-1.67 (1.74-1.67) 


Unique reflections 


29732 (2961) 


107958 (10692) 


34563 (3403) 


39430 (3873) 


Redundancy 


3.9 (3.9) 


11.7 (10.5) 


2.0 (2.0) 


9.9 (9.5) 


Completeness (%) 


98.0 (97.1) 


99.9 (99.9) 


96.7 (95.2) 


94.8 (95.7) 


Average l/a (1} 


14.7 (3.7) 


35.3 (3.5) 


9.4 (3.3) 


28.9 (2.2) 


Rmerge (%) 


10.0 (44.8) 


5.6 (45.8) 


6.9 (26.5) 


6.5 (71.0) 


Refinement 


No. of reflections 


28226 (1882) 


102490 (7495) 


31321 (1659) 


36995 (2645) 


Rwork (95% of data) 


19.7 (20.8) 


11.8 (17.4) 


20.2 (27.3) 


14.0 (15.5) 


Rfree (5% of data) 


26.8 (31.2) 


13.3 (18.8) 


24.9 (34.8) 


19.1 (23.7) 


R.m.s.d. bonds (A) 


0.011 


0.010 


0.010 


0.008 


R.m.s.d. angles (°) 


1.564 


1.628 


1.563 


1.47 


Dihedral angles 


Most favored (%) 


94.6 


97.8 


95.67 


96.4 


Allowed (%) 


5.4 


2.2 


4.33 


3.6 


Disallowed (%) 


0.0 


0.0 


0.0 


0.0 


Average B (A 2 )/atoms 


Protein 


26.6/4098 


13.8/2148 


28.2/4087 


25.9/2019 


Carbohydrate 


41 .7/256 


24.7/1 93 


38.6/231 


42.6/1 50 


Ions & ligands 


19.7/6 


21.7/28 


17.9/16 


50.0/71 


Water molecules 


35.0/457 


36.1/493 


36.6/426 


42.8/324 


PDB ID code 


4CWM 


4CXP 


4CXV 


4CXO 



doi:1 0.1 371 /journal.pone.01 05821 .t001 



16000 g for 30 minutes, the supernatant was measured at 260 run 
to determine the amount of acid-soluble DNA using NaroDrop 
2000 (Thermo, USA). 

Results and Discussion 

Extended N-glycan structures 

The protein portion of the sulfate co-crystal structure (4CXP) 
presented in this work is in very good agreement with our 
previously published AtBFN2'S0 4 co-crystal structure (3W52), 
which showed the overall fold of this endonuclease (Fig. 1A) [16]. 
The improved diffraction quality of 4XCP, however, allowed us to 
refine the structure to include almost all putative glycans, as 
described by Ko et at. [10], with only six moieties still missing 
(Fig. 1). As with the 3W52, 4XCP shows three N-glycosilation 
sites, at Asn91, Asnl 10, and Asnl84. 

The glycan bound to Asn91 is especially well defined and we 
can discern both N-acetylglucosamines (GlcNAc), and a further six 
mannose moieties (Man) (Fig. IB). Much like in 3W52, GlcNAc 1 
interacts extensively with the polypeptide, forming hydrogen 
bonds with Arg83, Tyr92, and Gln95. Further, the methyl group 
of GlcNAc 1 is located within a small aliphatic pocket formed by 
Leu57, and Ala88. GlcNAc2, on the other hand, interacts via 7t- 
ring stacking with Trp53. Man 1-5 do not interact with the 
protein. Man6, on the other hand, folds backwards into the 



protein, and interacts with the side chain of Gin 1 08 and with the 
carbonyl group of GlcNAc 2 (Fig. IB). 

While we were able to refine only one additional mannose 
moiety on the glycan bound to Asnl84 (Fig. ID), we obtained a 
much more detailed structure for the one attached to Asnl 10. It 
was thus possible to observe two more Man residues, which extend 
towards the solvent, and do not interact directly with the protein 
(Fig. 1C). A fucose moiety (Fuc) is attached to GlcNAc 1 via an al- 
3 glycoside bond, but it also makes no direct contacts with the 
protein. 

Interestingly, GlcNAc 1 at Asn 1 1 0 interacts specifically with a 
single sulfate molecule, which itself interacts with Arg46 and 
Arg50, and with the aromatic nitrogen of Trp43. The sulfate 
binding pocket seems well prepared to bind ligands which are both 
negatively charged and contain de-localized 7i-orbitals. It is 
tempting to think that this might be a possible accessory binding 
site for ssDNA. 

Structure and activity of the active Zn cluster 

In addition to our improved AtBFN2'SO + co-crystal structure, 
we were also able to obtain a new, ligand-free, structure, which 
was solved from crystals belonging to space-group P 1 (Fig. 1 A). In 
this structure, we found that four water molecules coordinate the 
tri-metallic cluster (Fig. 2 A). Water one (Watl) bridges the gap 
between Znl and Zn3, while waters two and three (Wat2, Wat3) 



PLOS ONE | www.plosone.org 



3 



August 2014 | Volume 9 | Issue 8 | e105821 



ssDNA Binding Site in the Multifunctional Endonuclease AtBFN2 



A 



N110 glycans 



N91 




Figure 1. Overall structure of AtBFN2 and extended glycan structures. A: In the ribbons diagram, AtBFN2-S0 4 (3W52, green) [16], Ligand- 
free AtBFN2_A chain (4CWM, cyan) and AtBFN2/S0 4 (4CXP, magenta) structures are superimposed (3W52 RMSD over 224 common Cot of 0.2 for 
chains A in 4CWM, and over 209 common Cot of 0.078 in 4CXP). Zinc and sulfate ions are shown as gray spheres and yellow sticks. The N-glycan 
structure is labeled for three N-glycosylation sites, 91, 110 and 184 in the same color as their respective main chains (only 4CXP is shown). Protein 
termini are labeled N and C, whereas the disordered region of residues 101-106 is indicated by yellow dots. B-D: Detailed 4CXP N-glycan structures 
and electron density maps. Glycans are shown as sticks and labeled, and the omit map contoured at 0.6a (gray) for glycans. Protein residues 
interacting with the glycans are labeled and shown as sticks. B: N-glycosylation site at Asn91; C: N-glycosylation site at Asn1 10 and D: N-glycosylation 
site at Asn184. 

doi:1 0.1 371 /journal.pone.01 05821 .g001 



exclusively bind Zn2. Finally, water four (Wat4) coordinates Znl. 
Watl is in the predicted required position for an activated 
nucleophile [16], further strengthening our earlier proposal for a 
reaction mechanism (Fig. 2B). In the S0 4 co-crystal structure, 
however, only Wat3 remains (Fig. 2C). 

Accordingly, crystals grown in conditions where lithium sulfate 
was substituted by sodium phosphate, have a phosphate anion 
displacing all coordinating waters, except Wat3 (Fig. 2D). 

Endonuclease activity inhibition assays confirmed Watl dis- 
placement by phosphate and/or sulfate anions. Indeed, phosphate 
inhibited DNA degradation by 40% at a 10 mM concentration, 
and by almost 85% at a 100 mM concentration (Fig. 3). Sulfate, 
however, did not affect enzymatic activity at the tested concen- 
trations, suggesting a much lower affinity for sulfate than for 
phosphate, as had been hinted by previous experiments with PC- 
PLC [31]. To confirm the effects of phosphate we tested the well- 
known phosphatase inhibitor vanadate, which is also a phosphate 
analog [32] (Fig. 3). As expected, vanadate affected endonuclease 



activity at lower concentrations than phosphate, achieving 80% 
inhibition at just 1 mM concentration. 

While these results clearly indicate that phosphate, and 
molecules similar to phosphate, have an inhibitory effect on 
AtBFN2, it is still not clear what the mechanism of inhibition is. 
Since phosphodiester bonds are the main substrate of AtBFN2, it 
is possible that phosphate acts via competitive inhibition. On the 
other hand, in all published mechanisms for endonuclease activity, 
the nucleic acid phosphate backbone displaces Wat2, Wat3 and 
Wat4, but not Watl, which is necessary for catalysis [16,17]. 
Watl, on the other hand, is only displaced by the cleaved product. 

Furthermore, in Endo IV, an unrelated endonuclease which 
nevertheless contains an extremely similar Zn tri-metallic binding 
pocket [33], mutagenesis studies produced a variant with a 
phosphate molecule in the binding pocket, resulting in an inactive 
protein. Garcin et al. concluded that the phosphate coordination 
in the mutant was not functional, and could be one of the factors 
leading to total protein inactivation, mimicking the cleaved 
product [33]. This all leads us to question the competitive 
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Figure 2. The coordination spheres of the tri-metallic Zn cluster with different ligands. The tri-metallic zinc cluster, shown as gray spheres 
in all stereo figures, is embedded in a central pocket of the enzyme. A: The ligand-free AtBFN2 (4CWM, green) active site structure shows four water 
molecules bound separately to the Zn cluster. Water one (Wat1) bridges the gap between Znl and Zn3, whereas waters two, three and four (Wat2, 
Wat3, and Wat4) exclusively bind a single Zn atom each. B: Schematic view of the binding pocket (top), and proposed activated nucleophile 
mechanism (bottom). The covalent bonds are depicted by solid lines and the non-covalent interactions by dashed lines. The arrows denote the paths 
of electrons during the catalysis. The phosphate is connected to the preceding R, and the succeeding R 2 nucleosides by the 03' and the 05' atoms, 
respectively. When Watl is activated by D45, it attacks the phosphate, which is being stabilized by the other Zn atoms, and K48. As a result, the bond 
with the 3' nucleoside is broken. C: AtBFN2/S0 4 (4CXP, magenta) reveal a sulfate anion bound within the active site, displacing Wat1, Wat2 and Wat4. 
D: AtBFN2/P0 4 (4CXV, cyan), with a phosphate anion in the same position as the sulfate was in C. 
doi:1 0.1 371 /journal.pone.01 05821 .g002 



inhibition hypothesis. We propose, therefore, that phosphate 
inhibition might not be competitive, as it would not interfere with 
ssDNA binding, instead acting by disrupting the active site native 
coordination sphere. 

The ssDNA«AtBFN2 complex structure reveals a 
secondary, glycan dependent ssDNA binding site 

By soaking our crystals in a 0.2 mM solution of the thio-ssDNA 
analog A 5 T (sequence: d[A(s)A(s)A(s)A(s)A(s)T]), we managed to 
obtain the first ssDNA»AtBFN2 co-crystal structure (4CXO, 
Table 1). Here we observed two distinct binding pockets. A single 
deoxythymidine monophosphate residue was bound to the active 
site, in much the same way as has been described for PI nuclease 
[17]. We were unable to refine any further nucleotides at this site, 
indicating that the rest of the hexanucleotide was probably highly 
flexible. The thymine moiety was stacked against Tyr59 and 
lodged within a pocket comprising the peptide backbones of 
Hisl30, Tysl37, and Glyl38, and the side-chains of Tyr59, Asn61, 



and Teul29 (Fig. 4A). Asn61 bound the thymine carbonyl oxygen 
4 via a hydrogen bond. The deoxyribose was bound to Asnl40 at 
the ring oxygen, while the 3'-end was tightly coordinated to Zn2, 
displacing the Wat3, which was present in the other three 
structures. Under these conditions, we also found a single sulfate 
anion within the binding pocket, which had displaced the other 
waters, including catalytically relevant Watl. 

The secondary binding site is completely novel, and much 
larger than the active site pocket. In it, elements from both the 
protein and the N-glycan at Asn9 1 participate in the binding of a 
dinucleotide composed of one adenosine, and one thymidine 
(Fig. 4B). As with the active site, a large part of the ssDNA 
molecule could not be refined. However, since thymidine was 
present in both binding sites, it follows that the two binding sites 
are occupied by distinct molecules. At the 5'-end of the single 
stranded dinucleotide, the adenosine is bound by an extensive 
network of hydrogen bonds and hydrophobic interactions, with 
Tyr70 stacking with the purine moiety. Further, the amide 



140 -r 



120 - 



100 - 



80 - 



03 

Z 

» 60 

w 



1 40 

CD 
DC 



20 -- 




100mM 10mM 1mM 10mM 1mM 100mM 100mM 100mM 100mM 
NaH2P04 NaH2P04 NaH2P04 Na3V04 Na3V04 U2SO4 LiCI Na2S04 Tris buffer 



Figure 3. The inhibitory effect of phosphate, and the phosphate analog on AtBFN2. Normalized AtBFN2 nuclease activity was assayed 
with ssDNA in the presence of various additives. Both phosphate and vanadate inhibit AtBFN2, while the others do not. Normalization was calculated 
against the standard conditions (100 mM Tris buffer) corresponding to 277.91 ng/ul digested ssDNA. 
doi:1 0.1 371 /journal.pone.01 05821 .g003 
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B 




C 7071 8182 87 9091 94 

AtBFN2 YQYNRD//GEKGRCVAGA I YNYTTQL//S 

BEN1 FSYARD//GNKNVCVVGA I NNYTAAL//S 

ABN1 YE YSRD//GL EDMCVDGA I QNFTSQL//- 

AtBFNI YEYSRD//GLKDMCVDGA I QNFTSQL//- 

ZEN1 FDYTRD//GMVDMCVAGA I KNFTSQL//- 

CEL1 FDYQRD//GGKDMCVAGA I QNFTSQL//- 

HBN1 FDYSRD//GLENMCVAGA I QNFTSQL//- 

TBN1 FDYERD//GVKDMCVAGA I QNFTTQL//- 

P1 nuclease VD YE RD// CS I SA I AN YTQRV// - 

BcPLC HFYDPD// I PFAKQAKETGAKYFKLA//- 

Consensus F D YS RD//GL KDMC VAGA I QNFTSQL//- 



109 201 

QYNL//- TKK 
PFNP//RSKT 
RHNM//- I DL 
RYNM//- NDL 
RYNM//- DD I 
RYNM//- DD I 
RYNM//- DDL 
RYNM//- GN V 

- - - - //N I S E 

- - QA// K 

RYNM//- DD * 



207 

T//C P D I Y 
T//CAEKY 
I//CPHKY 
I//CPHKY 
S//CVNKY 
S//CANKY 
A//C I NKF 
F//CVNKF 
P//TATRW 
p/AKQDY 
I//C* NKY 



Figure 4. ssDNA binding sites. Two nucleotide binding sites were revealed in the AtBFN2/A 5 T complex structure. A: nucleotide binding site in the 
active site with 5'-TMP and interacting protein residues. B: secondary nucleotide binding site with dinucleotide 5'-AMP-TMP, interacting protein 
residues and glycans. C: Alignment of the conserved residues in the secondary binding site. 
doi:1 0.1 371 /journal.pone.01 05821 .g004 



PLOS ONE | www.plosone.org 



7 



August 2014 | Volume 9 | Issue 8 | e105821 



ssDNA Binding Site in the Multifunctional Endonuclease AtBFN2 




N184 



N184 



ssDNA 



Figure 5. The ssDNA binding pockets and wrapped ssDNA binding model. Top: DelPhi electrostatic surface ranging from -3 (deep red) to 3 
(deep blue). Zn is colored gray, and was not used in the electrostatic potential calculations. The model was generated by combining the well-defined 
glycan structures of 4CXP (green) with the ssDNA ligands of 4CXO (orange). The protein is presented from the active site side, on the left, and rotated 
by 180 degrees, on the right. Bottom: Schematic representation of the above model. In total, we identified two experimental binding pockets (BP1 
and BP3), and we propose two more (BP2 and BP4). BP1 is the active site, in which the ssDNA hydrolysis takes place. BP2 is lined by Trp43, Arg46, 
Arg50, and GlcNAd from N-glycan at position 1 10. BP3 is the large secondary ssDNA binding site, which was determined in our crystal structure. It is 
described in greater detail in Fig. 4 and is well conserved in our alignment. BP4 is a positive charge-rich extension of BP3, which in addition contains 
three tryptophans and one tyrosine (Fig. S2). We propose that the ssDNA will wrap itself around the AtBFN2 molecule via BP2, BP3, and BP4, with 
further stabilization provided by the N-glycans (green filaments in our figure). The reaction will then take place in BP1. 
doi:1 0.1 371 /journal.pone.01 05821 .g005 



belonging to Asn71 coordinates the aromatic nitrogen 7 and the 
primary amine group on the adenine ring. The deoxyribose 
portion of the adenosine interacts via a Tt-ring interaction with 
Tyr90, while the phosphate backbone attracts the positive side- 
chain of Lys20 1 . 

The thymidine, on the other hand, is bound within a relatively 
hydrophobic pocket lined by Gly87 at a-helix 7, with glycosylation 
site 91 in the direct vicinity. Indeed, GlcNAcl at Asn91 interacts 
via its acetyl group with the 3'-end of the thymidine deoxyribose, 
which also interacts with the backbone of Lys81. At the same time, 
the Asn91 side-chain, and the GlcNAcl ring stack the thymine 
moiety. Finally, a hydrophobic contact between Tyrl09 and the 
thymine aromatic ring further stabilizes the glycoprotein-N- 
glycan-nucleotide complex. 



A wrapped ssDNA binding model 

The secondary binding site possibly reveals a new and crucial 
role for glycosylation at Asn91, since GlcNAcl is an important 
partner in binding the dinucleotide. A reasonable hypothesis 
would be that a longer ssDNA strand would be capable of 
interaction with further glycans. Thus, we propose that N- 
glycosylation at Asn91 has not only a role in maintaining 
structural integrity, but also in substrate binding. 

While it was only possible to observe two nucleotides from our 
short DNA sequence within the secondary binding pocket, the 
relative position of these, in comparison with the adenosine in the 
active site, is suggestive of a long, snaking DNA molecule wrapped 
around the surface of AtBFN2 (Fig. 5). Based on DelPhi 
electrostatic calculations [34], we propose that there are two 
further ssDNA binding pockets (pockets two and four in Fig. 5). 
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Pocket two is occupied in the sulfate co-crystal structure by sulfate 
(Fig. 1A), suggesting that it should be possible for the ssDNA 
backbone to be bound in a similar fashion. On the other hand, 
pocket four, an extension of pocket three, is positively charged, 
and contains several aromatic residues (Figure 5, and Figure SI in 
File SI). 

The physiological role of the Tyr site on the PI nuclease 
(Tyrl44 and Tyrl55, Figure SI in File SI) has not yet been 
confirmed, and there is still doubt as to its relevance outside the 
crystal environment, since the key amino-acids are poorly 
conserved [17]. Therefore we sought to evaluate the degree of 
conservation, in comparison to the Tyr site, of our experimental 
and proposed secondary binding sites by performing an alignment 
based on several plant Zn 2+ -dependent nucleases, and the more 
distantly related PI nuclease, and BcPLC (Fig. 4C and Figure SI 
in File SI). The majority of the amino-acids involved in the 
experimental secondary binding site (pocket three) are well 
conserved within the plant Zn 2+ -dependent nucleases, and the 
ones with a lesser degree of conservation are replaced by residues 
capable of similar interactions. For example, Asn71, was 
conservatively replaced by serine in the consensus sequence 
(Fig. 4C). On the other hand, Lys201 is most often replaced by 
glutamate, which is incapable of interacting with the phosphate 
backbone. However, glutamate side-chains are highly flexible, 
resulting in low repulsion against the nucleic acid binding. 
Additionally, there may be alternative pathways by which the 
binding pocket may attain a similar geometry. For example, the 
tomato multifunctional endonuclease TBN 1 presents a Gin instead 
of Tyr at position 90 (Fig. 4C). The TBN1 structure (3SNG) [22] 
also comprises an N-terminal extension (lacking in AtBFN2) of a 
helix 1 1 with an exposed phenylalanine. Thus, it is possible that 
the side-chains of Gin and Phe are able to structurally compensate 
for the absence of Tyr90 (Figure S2 in File SI). Furthermore, the 
computed binding sites (pockets two and four) exhibit high 
sequence similarity, with conserved amino acid substitutions, e.g. 
Arg to Lys (Figure SI in File SI). Considering the high levels of 
sequence and structure conservation in the proposed ssDNA 
secondary binding sites, it is highly likely that they have some 
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